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Abstract. We illustrate a process that constructs martingales from raw ma- 
terial that arises naturally from the theory of sampling without replacement. 
The usefulness of the new martingales is illustrated by the development of 
maximal inequalities for permuted sequences of real numbers. Some of these 
inequalities are new and some are variations of classical inequalities like those 
introduced by A. Garsia in the study of rearrangement of orthogonal series. 
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1. Introduction: A Motivating Question 

Let X = {x\, X2, x n } be a fixed set of real numbers, and let X\ 1 X%, X n 
be the successive values of a sample of size n that is drawn sequentially without 
replacement from the set X. We are concerned here with a systematic process by 
which one can construct martingales with respect to the sequence of sigma-fields 
a(Xi,X2, ...,Xk), 1 < k < n. For example, we consider the partial sums 

S k =X 1 +X 2 + --- + X k , l<k<n, 

and ask: Is there an {Tk : 1 < k < n} martingale where the values {S% : 1 < k < n} 
appear in an simple and explicit way? What about the values : 1 < k < n}, 
or the partial sums of Xf, etc. We show that there is a practical, unified approach 
to these problems. It faces some limitations due to the burden of algebra, but one 
can make considerable progress before those burdens become too cumbersome. 

We first illustrate the construction with two basic examples. These lead in turn 
to several martingales whose usefulness we indicate by the derivation of permutation 
inequalities — both old and new. 



2. First Example of the Construction 

To begin, we consider 2/. = X\ + Xf + • • • + X% for 1 < k < n, and we ask for 
a martingale where T k (or a deterministic multiple of T^) appears as a summand. 
The set X is known before sampling beings, and the only source of randomness is 



Date: October 30, 2012. 

Vladimir Pozdnyakov: Department of Statistics, University of Connecticut, 215 Glenbrook Road, 
U-4120, Storrs, CT 06269-4120, Email: vladimir.pozdnyakovOuconn.edu. 

J. M. Steele: The Wharton School, Department of Statistics, Huntsman Hall 447, University of 
Pennsylvania, Philadelphia, PA 19104. Email address: steeleSwharton.upenn.edu. 

1 



POZDNYAKOV AND STEELE 



the sampling process itself. The totals are known, deterministic, values which we 
denote by 

(1) M = S n and B = T n . 

We first compute the conditional expectation of S\ +1 given the immediate past, 
E[Sl +l \T k ]=E[(S k + X k+1 f\T k ] 



E[(Sl + 2S k X k+1 + Xl +l \T k ] 
M-S k B-T k 
n — k n — k 



= S k + 2S k 



n - k - 2 ^ 2 2M 



n — k 



n — k 



1 nn B 

T k + 



n — k 



n — k' 



To organize this information, introduce the vector-column £ k = (S k , S k , T k , 1) T , 
and note that wc also have 



E[S k+ i\F k ] = S k H — = — — S k + 



n — k 



n — k 



-Epfc+llJ"*;] —T k + — ^- — " ~ T k + 



n — k 
B 



and 



n — k 



n — k 



n — k 



These observations can be combined into one matrix equation for the one-step 
conditional expected values 

EKfe+il-fife] =M+i£ k , 
where here the deterministic 4x4 matrix A k+ i is given explicitly by 



Ah+i = 



n-k-2 

n — k 





V 











2M 
n — k 

n — k — 1 
n — k 






B 



n—k n—k 
M 



\ 







n — k 
n-k-1 B 



n — k 




n — k 
1 



The matrices {A k : 1 < k < n — 2} are invertible and deterministic, so the vector 
process 



(2) 



M k = A^A^--.A^i k 



is well-defined and adapted to {T k : 1 < k < n — 2}. To check that it is a vector 
martingale we only need to note that 

(3) E{M k+1 \T k ) =E{A^A^.-.All^ k+1 \T k ) 
=A^ 1 A^.-.A- k l l E{i k+1 \F k ) 



=A- 1 1 A^ 1 ■ ■ ■ A^ +l A k+1 i k - W • • • A-^ k 



Mi 



Now, to extract the benefit from the martingale {M k }, one just needs to make it 
more explicit. 
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Here one is fortunate; an easy induction confirms that A 1 1 A 2 1 --- A k 1 is given 
by the upper-triangular matrix: 



/ n(n-l) 
n — k — 1 






2knM 



n — k — 1 n 



kn 



k(k + 1)M 2 - knB \ 



n — k — 1 
-kM 

-kB 



V 







/ 



Now, the coordinates of the vector martingale M & = (M\ t k, M^ki ^3,ki M4 tk ) T arc 
martingales in their own right, and it is worthwhile to examine them individually. 

The fourth coordinate just gives the trivial martingale = 1, but the other 
coordinates are much more interesting. The second and third coordinates give us 
two useful — but known — martingales, 



(4) 



M 2 , k = (nS k - kM)/(n - k) and M 3jfc = (nT k - kB)/{n - k). 



Actually, we have only one martingale here; one gets the martingale {M^} from 



the martingale {M 2}k \ if one replaces X with the set of squares X' = {xf, 



b 2i ■ 



■ i x n } • 



The martingale {M 2 , fc } is given in Serfling (1974) and Stout (1974, p. 147). The 
earliest source we could identify for the martingale {il^fc} is Garsia (1968, p. 82), 
but it is hard to say if this was its first appearance. The martingale {Af 2 ,fc} is a 
natural one with few impediments to its discovery. 

To get a martingale that is more novel (and less transparent), we only need 
to consider the first coordinate of the vector martingale M^. One can write this 
coordinate explicitly as 



M 



i,fc 



n(n — 1) 



2knM 



(n - k)(n - k- 1) 
kn 

(n — k)(n — k — 1) 



-S k 



(n — k)(n — k — 1) 
k(k + 1)M 2 - knB 
(n — k)(n — k — 1) 



This may seem complicated at first sight, but there is room for simplification. In 
many situations it is natural to assume that M = S n — 0, and, in that case, 
reduces to the more manageable martingale sequence that we denote by 



(5) M k 



1 



(n — k)(n — k — 1) 



[(' 



1)52 



(B - T k )k] 



1 < k < n- 2. 



We will give several applications of this martingale to permutation inequalities, 
and, in particular, we use it in Section[6]to get a new bi-quadratic maximal inequal- 
ity for permuted arrangements. We should also note that this martingale also has 
a potentially useful monotonicity property. Specifically, T k is monotone increasing, 
so by a little surgery on T k (say by replacing T k by k a T k ) will yield one a rich 
family of submartingales or supermartingales. 

The device used here to get the 4- vector martingale {M n : 1 < k < n — 2} can 
be extended in several ways. The most direct approach begins with 5| in addition 
to S%. In that case, linearization of the recursions requires one to introduce the 
terms S k T k and U k = Xf + X 2 + - ■ '+X%. The vectors {£ k } are then 7-dimensional, 
and, the matrix algebra is still tractable through symbolic computation, but it is 
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unpleasant to display. One can follow the construction and obtain seven martin- 
gales. Some of these are known, but many as four may be new. The ones with S k 
and SkTk are (almost) guaranteed to be new. 

Nevertheless, we do not pursue the seven-dimensional example here. Instead, for 
our second example, we apply the general construction to a simpler two-dimensional 
problem. In this example, the algebra is much more attractive, and the martingale 
that one finds has considerable flexibility. 



3. Second Example of the Construction 

As before, we assume that Xx,X2, • AT n is a sequential sample taken without 
replacement from X , and we impose the centering condition S n — M — 0. Further, 
we consider a system of "multipliers" ak, 1 < k < n where each ak is assumed to 
be an Tk-i measurable random variable. Since our measure space is finite, these 
non-anticipating random variables are automatically bounded. 

The basic building block for our next collection of martingales is the sequence 
of random variables defined by setting 



Wk = a\X\ + a 2 X 2 



akXk for 1 < k < n. 



The immediate task is to find a martingale that has Wk (or a deterministic multiple 
of Wk) as a summand. 

As before, we begin by calculating the one-step conditional expectations: 

E[Wk+i\Tk]=Wk-^rS k and E[S k+1 \T k ] = — - ~ ' 



If we introduce the vector rj k = (Wk, Sk) T we have 



-Sk 



/I 



E [Vk+i\Fk] = Ak+iVk where A k+1 







n — k 
n — k — 1 



n 



\ 
) 



Inversion is now especially easy, and we note that for 



/I 

V 



we have A, 



; / 

n-k+1 ' 

so induction again confirms the critical inverse: 

(l 







n — k 
n — k + 



it. 



- k 



) 



A 1 A 2 X ■■ ■ A k 1 







ft! + a 2 + h Ofc \ 



n — k 
n 



n 



The general recipe ([3]) then gives us a new martingale: 



(6) 



M k = Wk 



a-i + a 2 + • • • + a k 

n — k 



S k 



1 < k < n 



Once this is martingale is written down, one could also verify the martingale prop- 
erty by a direct calculation. In this instance, linear algebra has served us mainly 
as a tool for discovery. 
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Some of the benefits of the martingale ((6]) are brought to life through interesting 
choices for the non-anticipating factors {a k : 1 < k < n}. For, example, if we take 
a\ — and set a k = Xk~i for k > 2, we find a curious quadratic martingale 

M k = Xl X 2 + X 2 X 3 + ■■■ + X k . lXk + (Xl + --- +Xfc - l) f 1 + "- + Xfc) . 

n — k 

One is unlikely to have hit upon this martingale without the benefit of a systematic 
construction. For the moment, no application of this martingale comes to mind, 
but it does seem useful to know that there is such a simple quadratic martingale. 
Perhaps a nice application is not too far away. 

Out main purpose here is to expose the general construction (j3]), but, through the 
illustrations just given, we now have several martingales that speak directly to per- 
mutation inequalities — an extensive subject where martingales have traditionally 
been part of the toolkit. As we explore what can be done with the new martin- 
gales ([5]) and ©, we will also address some of the classic results on permutation 
inequalities. 

4. A Permutation Inequality 

In the first application, we quickly check what can be done with the martingale 
A/2, = {nSk — kM)/{n — k) given by (j4|) in our first construction. To keep the 
formulas simple, we impose a standing assumption, 

(7) S n = M = 0, 

so, Sk/{n — k) is an {J-fc} martingale with expectation zero. The Doob-Kolmogorov 
L 2 maximal inequality (see e.g. Shiryaev (1995, p. 493)) then gives us 

E max \(n- k^SJ 2 < AES 2 ^. 

l<k<n— 1 

This can be simplified by noting that 

ESf^ = E(M - X n f = EX* = EX\ = B/n, 
so, in the end, we have 

(8) E max \(n - k)' 1 S k \ 2 < AB/n. 

l<k<n—l 

One could immediately transcribe this as a permutation inequality, but first we put 
it in a form that seems more natural for applications. 

For any fixed permutation <r, the distribution of the vector (X±, X2, ■ ■ ■ , X n ) 
is the same as distribution the vector (X^m, X a (2), ■ ■ ■ ,X a ^) (cf. Feller (1971, 
p. 228)), so, in particular the distribution of {X\, X 2 , ■ ■ ■ ,X n ) is the same as the 
distribution of (X n , A„_i, . . . , X\). By the centering assumption ([7]), we know 

{X 1 + X 2 + ■ ■ ■ + X k f = (X k+1 + x k+2 + ■■■+ x n f, 

so applying both observations gives us the identity 

I —1 |2 2 

E max \(n — k)~ S k \ =E max \S k /k\ . 

l<fc<n-l 1 1 l<fe<n-l 

Using this bound in ([8]) then gives us 

(9) E max \S k /k\ 2 < AB/n, 

Kk<n—1 
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so by the sampling model and the definition of B, we come to an attractive inequal- 
ity for the maximum of averages drawn sequentially from a randomly permutated 
sample. 

Proposition 1 (Max- Averages Inequality). For real numbers {xi,X2, ...,x n } with 
x\ + X2 + ■ ■ ■ + x n = 0, we have 

(10) — max 
n\ ^ \<k< 

a 

This bound is so natural, it is likely to be part of the folklore of permutation 
inequalities, but we have been unable to locate it in earlier work. Still, even if the 
inequality is known, it seems probable that it has been under appreciated. In two 
examples in Section[5]we find that it provides an efficient alternative to other, more 
complicated tools. 

This inequality also bears a family relationship to an important inequality that 
originated with Hardy (1920). Hardy's inequality went through some evolution 
before it reached its modern form (see Steele (2004), pp. 169 and 290 for historical 
comments), but now one may write Hardy's inequality in a definitive way that 
underscores the analogy with (fTUl) : 

n ( _^ k \ 2 n 

(11) m f ^ 4 E : 

fc=i l i=i ) i=i 

It is well known (and easy to prove) that the constant in Hardy's inequality is best 
possible, and it is feasible that the constant in (fpQ|) is also best possible. We have 
not been able to resolve this question. 

5. Exchangeability and a Folding Device 

One also has the possibility of using exchangeability more forcefully; in par- 
ticular, one can exploit exchangeability around the center of the sample. For a 
preliminary illustration of this possibility, we fix 1 < m < n and note that the 
martingale property ([5]) of Mk gives us 

KS* —E -[B - T m ] = -[B - /•."/,„ . 

Ti—l n — 1 

The martingale property ^ of M^^ also gives us ET m — (m/n)B, so we have 

(12) ESI = ^p^ B ^ 

n{n — 1) 

a fact that one can also get by bare hands, though perhaps not so transparently. 

We can also use the martingale M 2 .k = {nS k — kM)/(n — k) = nSk/(n — k) 
here. Since we have M = 0, the L 2 maximal inequality for this martingale and the 
identity (fT2"j) give us 



xl 



(13) E max 

KKm 



Sk 



- k 



2 4ESI 4mB 



(n — m) 2 n(n — l)(n — m) 

Since (n — k) 2 < n _1 (n — 1) for 1 < k < m, the bound (|13[) implies the weaker, but 
simpler, bound 

(14) E max Si < for 1 < m< n. 

i<k<m n — m 
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The idea now is to use symmetry to exploit the fact that (|T4|) holds for many choices 
of to. To begin, we note that we always have the crude bound 

(15) E max S 2 k < E max S 2 k + E max S 2 k . 

l<k<n l<k<m m<k<n 

Typically this is useless, but here it points to a useful observation; we can use the 
same bounds on the second terms that we used on the first. This is the "folding 
device" of the section heading. 
Specifically, since M = we have 

\Sk\ = \S n — Sk\ = \x. n + X n -i + • • • + X k +i\, 

so, by exchangeability, we see that the second summand of (|15[) is bounded by 
4(n — m)B/m. Thus, we have for all 1 < m < n that 

(16) E max S% < E max Si + E max S% < AB 

l<k<n l<k<m m<k<n 

Here, if we just consider n > 8 and choose to as close as possible to n/2, then we 
see that the worst case occurs when n = 9 and to = 4; so we have the bound 

(17) E max S% < E max Sf + E max S% < (41/5)B for n > 8. 

l<fc<n l<fc<m m<k<n 

On the other hand, Cauchy's inequality gives us S 2 < kB < nB, so the bound 
(fl"7|) is trivially true for n < 7. Combining these ranges gives us a centralized 
version of an inequality originating with A. Garsia (cf. Stout (1974, pp. 145-148)). 

Proposition 2 (A. Garsia). For any set of real numbers {x\, X2, x n } with sum 
%i + %2 H h x n = we have 

^ k n 

a i—1 i—1 

where the sum is taken over all possible permutations of {xi,X2, ...,x n }. 

Here we have paid some attention to the constant in this inequality, but already 
from the crude bound (|15p one knows that the present approach is not suited to the 
derivation of a best possible result. Our intention has been simply to illustrate what 
one can do with reasonable care and robust approach that uses the tools provided 
by our general martingale construction. 

Nevertheless, the constant in this inequality has an interesting history. The 
bound seems first to have appeared in Garsia (1968, Theorem 3.2) with the constant 
9. Curiously, the inequality appears later in Garsia (1970, eqn. 3.7.15, p. 91) where 
the constant is given as 16, and for the proof of the inequality one is advised 
to "following the same steps" of Garsia (1964). In each instance, the intended 
applications did not call for sharp constants, so these variations are scientifically 
inconsequential. Still, they do make one curious. 

Currently the best value for the constant in the Garsia inequality flT8"|) is due 
to Chobanyan (1994, Corollary 3.3) where the stunning value of 2 is obtained. 
Moreover, Chobanyan and Salehi (2001, Corollary 2.8)) have a much more general 
inequality which also gives a constant of 2 when specialized to our situation. 

Here we should also note that in all inequalities of this type have both a central- 
ized version where M = and non-centralized version where M is unconstrained. 
One can pass easily between the versions (see e.g. Stout (1974, p. 147) or Garsia 
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(1970, p. 93)). The centralized versions are inevitably simpler to state, so we have 
omitted discussions of the non-centralized versions. 



6. Quadratic Permutation Inequality 



To apply the L 2 maximal inequality to the martingale {Mj~ : 1 < k < n — 2} 
that was discovered by our first construction ([5]), one needs a comfortable estimate 
of the bounding term AE[M 2 _ 2 ] in the Doob-Kolmogorov inequality. There are 
classical formulas for the moments for sampling without replacement that simplify 
this task. These formulas were known to Isserlis (1931), if not before, but they are 
perhaps easiest to derive on one's own. 

Lemma 3. IfX\, X2, X3, X4 are draw without replacement from X — {x\, X2, x n } 
where x\ + X2 + • • • + x n — 0, then we have the moments 



£(XiX 2 M 4 ) 



3B 2 - 6Q 



n(n- l)(n-2)(n-3)' 



E(X\X2Xz 



2Q - B 2 



E{X\Xl) = 



B 2 -Q 



E(XfX 2 ) = - 



n(n-l)(n-2)' 
E(Xt) = Q/n, 



n(n — 1) ' n(n — 1) ' 

where, as usual, we set B = x\ + x\ + ■ ■ ■ + x^ and Q = xf + x\ + • • • + 
Now, to calculate AE[M 2 _ 2 ], we first note that 

S n -i = —X n and S'n-2 = — X n _\ — X n , 
so just expanding the definition of M n _2 gives us 

AE[Ml_ 2 ] =E \{(n - l)Sl_ 2 - (n - 2){B - T„_ 2 )} 



--E 



{(n - l)(X n _a + X n f - (n - 2)(X 2 _ 1 + X 2 n )Y 



=E[X*_ 1 + X* + (An 2 -8n + 6)X 2 ^X 2 
+ 4(n - m^Xn + 4(n - l)X n ^X^] . 

By Lemma [3] and exchangeability, one then finds after some algebra that the L 2 
maximal inequality takes the form 



(19) E max M 2 < 4E[M 2 _ 2 ] 

Kk<7i — 2 



An 2 - 8n + 6 „ 2 



n(n — 1) 



-B 1 



An 2 



2n 



n(n — 1) 



The only task left is to reframe this inequality so that it easy to apply as a permu- 
tation inequality. Here it is useful to observe that 



(20) 



An 2 



8n + 6 „ , An 2 
< A and 



2n 



n(n — 1) 



n(n — 1) 



> 4 



for n > 2, 



moreover, these are not wasteful bounds; they are essentially sharp for large n. 
When we apply these bounds in (fl9|) we get the nicer bound, 



(21) 



E max 

Kfe<n-2 



(n - l)S 2 - k(B - T k ) 



(n — k)(n — k — 1) 



< 4[B 2 - Q] 
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This brings us closer to our goal, but further simplification is possible if we note 
that by reverse sampling the left side of (|21|) can also be written as 



E max 

Kfe<n-2 



(n - 



(n — k)(n — k — 1) 



-E max 

2<fc<n-l 



(n - l)S 2 k - (n - fc)T fc 



fc(fc - 1) 



-E max 

2<fc<ri 



(n - l)Sl -{n- k)T k 



k{k - 1) 

In permutation terms, the bound (1211) establishes the following proposition. 

Proposition 4 (Quadratic Permutation Inequality). For any set of real numbers 
{xi, 22, x„} wii/i xi + X2 + • • • + x n = we have 

2 



— - max 

TV. Z —'2<k<n 



Ei=l 



'<r(i) 



_ 7i-fc v^fc 2 
n— 1 ^i=l ^it 



(i) 



k(k- 1) 




»=i 



where the first sum is taken over all possible permutations of {xi, X2, x n }. 

At first glance, this may seem complicated, but the components are all readily 
interpretable and the inequality is no more complicated than it has to be. The main 
observation is that our general construction ([2]) brought us here in a completely 
straightforward way. Without such an on-ramp, one is unlikely to imagine any 
bound of this kind — and relative simplicity. Moreover, the inequality does have 
intuitive content, and this content is made more explicit in the next section. 



7. The Discrete Bridge and Further Folding 

Here we set Xi = 1 for 1 < i < m and take Xi = — 1 for m + 1 < i < 2m. We 
then let X$, 1 < i < 2m, denote samples that are drawn without replacement from 
the set X = {xi, X2, . . . , xi m \. If we put Sq = and denote the usual partial sums 
by Sk, 1 < k < 2m, then S 2 m = and the process {Sk : < k < 2m} is a discrete 
analog of the Brownian bridge. Alternatively, one can view this process as simple 
random walk that is conditioned to return to at time 2m. 

For {Mfc}, the martingale from the first construction ([5]), we now have T k = k, 
so we have the simple representation 

(2m - l)Sf - (2m - k)k 



(22) 



M k = 



1 < k < 2m - 2. 



(2m - k){2m - k - 1) ' 

By Lemma[3]with B = Q = 2m and by p2[) . we also find that at the (left) mid-point 
m of our process we have the nice relations 

3m 4 — 4m 3 



(23) 



E[S'J = m 2 /(2m - 1) and E[S^} = 



4m 2 — 8m + 3 



These give us a rational formula for E[M^], but for the moment, we just use the 
partial simplification 

m 2 

E\Ml 1 = E 



(2m — 1)5^, — (2m — m)m 



(2m — m)(2m — m — 1) 
' 1 '"m - VfESi 



m?(m — 1) 



m 



10 



POZDNYAKOV AND STEELE 



For 1 < k < m we have the trivial bound 

(2m) (2 (m - 1)) 



1 < max 

Kk<m 



(2m - k)(2m - k - 1) 
so the L 2 maximal inequality applied to the martingale (I22[) gives us 
(24) E max I (2m - - (2m - k)k? < 64 [(2m - 1) 2 ES$ 

l<k<m 



m 4 l 



We can now take advantage of a symmetry that is special to the discrete bridge. 
Since S^m = we have {X\ H h^) 2 = (X 2m H h -X~fe+i) 2 i and by exchange- 
ability the vectors (Xi, . . . , X ro ) and (-X2 m , . . . , X m +±) have the same distribution. 
Also, the value (2m — k)k is "invariant" in the following sense: if we substitute for 
k (the number of summands in X\ + X 2 + • • -+X k ) the value 2m — k (the number of 
summands in X<z m + X-i m -\ + ■•• + X^+i), then the symmetric quantity (2m — k)k 
is unchanged. As a consequence, the random variables 

max | (2m — l)^ — (2m — fc)fc| 2 and max | (2m — l)S k — (2m — k)k\ 2 

are equal in distribution. 

We can then apply ([24]) twice to obtain 

E max \(2m-l)Sl-(2m-k)k\ 2 < 12% \(2m- I) 2 E St- m A1 \. 

l<fc<2m-l |V J k \ ' \ — J 

From this bound and the formula (1231) for ESz, , we then have 



E max 

KK2m-l 



2m - 1 



4m 3 — 8m 2 + 4m 

< 198 m 

" 8m 3 - 20m 2 + 14m - 3 



2 



and, for any m > 2 we have (4m 3 — 8m 2 + 4m)/(8m 3 — 20m 2 + 14m — 3) < 1. In 
the end, we have the following proposition. 

Proposition 5 (Quadratic Permutation Inequality for Discrete Bridge). For the set 

X = {xi, X2, . . . , X2m} with Xi = 1 for 1 < i < m and Xi = — 1 /or m + 1 < i < 2m 
one has 



(25) ttt-tt ^ m ax 

(2m)! ^— ' i<fe<2m-i 




2 



2m — fc 



2m - 1 



2 



< 128m 2 , 



where the first sum is taken over all possible permutations. 

For the terms of the squared discrete bridge process {S* 2 : < i < 2m} have the 
expectations ESf = k(2m — k)/(2m — 1), so (1231) gives us a rigorous bound on the 
maximum deviation of the square S 2 of a discrete bridge from its expected value 
ES 2 . Here, the order 0(m 2 ) of the bound cannot be improved, but, if necessity 
called, one may be able to improve on ungainly constant 128. 

8. Permuted Sums with Fixed Weights 
In Section |3l we considered the weighted sums 

W k = aiXx + a 2 X 2 H h a k X k for 1 < k < n. 

and we found that the process 

(26) M k = W k + ai+a2 + -;- +ak S k l<k<n 

n — k 
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is a martingale whenever the multipliers are non-anticipating random variables (i.e. 
whenever a k is J^-i-measurable for each 1 < k < n). Here we will show that this 
martingale has informative uses even when one simply takes the multipliers to be 
fixed real numbers. 

First we introduce some shorthand. For 1 < k < n we will write 



(27) 



a,\(k) = a\ + a 2 + • • • + a k and a 2 (fc) = a\ + a\ + • • • + a k . 



Also, we will be most interested in multiplier vectors a — (<2i, a 2 , . . . , a n ) for which 
we have some control of the quantity 



(28) 



V n (a) == max ct^ (fc) /a 2 (n) 

Kfe<n— 1 



which is a measure of cancelation among the multipliers. A leading example worth 
keeping in mind is the sequence a k = (— l) fe+1 , 1 < k < n, for which we have 
V n {a) — 1/n. We will revisit the measure V n (a) after we derive a moment bound. 

Lemma 6. For the martingale {Mfc} defined by equation (|26|) . we have 



(29) 



E[Mi] = 



1 



-a 2 (k)B 



n - 1 

Proof. Simply squaring (|26p . we have 



(n — l)(n — k) 



a\{k)B. 



\M k \" = Wl + ^Lst + 2 r 

[n — fc) n — k 



W k S k , 



so we just need to find EW%, ES% and EW k S k . We know from (|T2"j) . and we 
have BX| = B/n and EXiXj = -B/(n(n - 1)) so 

k 



EWt =E 



2 y2 



<* 2 (A;)— + [a?(fe) -a 2 (fc)] 



E 

l<l,j<k,i^j 



aiajXiXj 



B 



n(n — 1) 



a 2 (k) 



B 



n - 1 



al(k) 



B 



n(n — 1) 



Similarly, we have 



k 



k 



EW k S k =E [ <kXf + J2 a * E X * X - 

i=l 



B 



i=i i<j<k,j^i 



B 



= ai(k) 



=ax(k) (fc-l)ai(fc) 

n \_n{n — 1J 

so summing up the terms completes the proof of the lemma 

From this lemma, the L 2 maximal inequality gives us 
(30) 



(n - k)B 
n(n — 1) 



□ 



B 



E max Mt <4£;[M„_i] 2 = 4[a 2 (n - 1) + af(n - 1)] 

l<fe<n— 1 n — 1 

but to extract real value from we need to relate it to the weighted sum W k . Using 
(|2"BT) we can write W k as the difference between M k and a\{k)S k /{n — k), so from 
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the trivial bound (x + y) 2 < 2x 2 + 2y 2 we have 



max \W k \ 2 < 2 max \M k \ 2 + 2 max a 2 (k) 

Kk<n-l ' Kfc<n-1 Kfc<n-1 



<2 max l-Mtl +2 max oa(k) ■ max 

Kfc<n-1 Kfc<n-1 Kfc<n-1 



The second step may look wasteful, but now we can apply both (|30|) and the Max- 
Averages inequality (|8]) from Section [4] to obtain 



E max |Wfc| 2 <8 a 2 (n - 1) + af (n - 1) 

l<fc<n— 1 L 

Note that we also have 



-+8 max an (k) — . 

n — 1 i<fe<n-i n 



Kk<7 



max |Wfc| = max max |Wfc| , (W n -i + a„X„) 



Kfe<n-1 



< max max \W k \ , 2W 2 _ 1 + 2a 2 n X; 
<2 max \W k \ 2 + 2a 2 n X 2 , 

Kk<n—1 



and the bottom line is that 
(31) 



e\ max \W k \ 2 \ = 4V max 

[l<fe<n J n\ ^ \<k<n 



^2 a ^ X °(i) 



is bounded by the lengthy (but perfectly tractable) sum 

D 



(32) 



1G 



0:2(71 — 1) + a 1 (n — 1) 



n — 1 



B 

16 max a?(fc) Y2a 2 —. 

Kfc<n-i n n 



To make this concrete, note that for a k = (—l) k+1 the bound on (|3Tj) that we get 
from (|32|) is simply (16n/ (n— 1) + 18/n)-B. The ratio 16n/(n — 1) + 18/n decreases 
to 16, and for n = 18 the upper bound is equal to (17 + 16/17)5. By Cauchy's 
inequality, we have \W k \ 2 < kB for all k, so the bound (17 + 16/17)i? also holds 
for all 11 < 17. When we assemble the pieces, we have a permutation maximal 
inequality for sums with alternating signs. 



Proposition 7. For real numbers {xi,X2, ...,x n } with x\ + X2 + 
have 



we 



(33) 



a i—1 y ' i—1 



where the sum is taken over all permutations of {x\, X%, £„}. 

The argument that leads to (1331) is useful for more than just alternating sums; 
it has bite whenever there is meaningful cancelation in a = (a±, 02, ... , a n ). Specif- 
ically, the bound ([32]) is always dominated by 

160:2(11) +32 max a\{k), 

Kk<n— 1 



so our proof also gives us more general — and potentially more applicable — bound. 
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Proposition 8. For sets of real numbers {ai, 012, a n } and {xi, X2, • x n } such 
that x\ + X2 + • • • + x n = we have 

1 fc 16 n n 

( 34 ) n\ £ i<\ a < X „ 1 E I 2 < + 2K(a)} E a? ]T a;?, 

cr i—1 i—1 i—1 

where the sum is taken over all permutations of {x\, x%, x n } and where V n (a) 
defined in (|28|) . 

This inequality shows that there are concrete benefits to introducing V(a). For 
example, by a sustained and subtle argument, Garsia (1970, 3.7.20, p. 92) arrived 
at a version of our inequality where the coefficient 16{1 + 2V n (a)} is replaced by 
80. Now, for uniform multipliers, one has V n (a) = O(n) and Garsia's inequality 
is greatly superior to our bound (l34l) . On the other hand, for multipliers that 
satisfy the cancelation property V n (a) — > 0, the present bound eventually meets 
and beats Garsia's bound. In particular, for multipliers given by alternating signs, 
the constant of is just 17 + 16/17. 



9. Weighted Sums and Folding 



By many accounts, the permutation maximal inequality of Garsia (1970, p. 86) 
is the salient result in the theory of permutation inequalities, so it is a natural 
challenge to see if it can be proved by the robust martingale methods that follow 
from our martingale constructions. We give a proof of this kind — without appeal 
to V n (a). Once the proof is complete, we address the differences between Garsia's 
inequality and the present bound with its curious constant. 

Proposition 9. For real numbers {a\, 02, a n } and real numbers {x\, x 2 , x n } 
such that x\ + x% + • • ■ + x n = we have 



(35) 



1 k 
— > max } 

nl ^— ' Kk<n 1 ^ 



Q>iX r 



Wl 



< (80 + — )YalYx 2 
~ v 205 ; ^ 1 ^ 

i=l i—1 



where the sum is taken over all possible permutations of {x\,X2, ...,x n }. 

Proof. First, for all 1 < k < n, Cauchy's inequality gives us a\(k) < fca^fc). 
Trivially one has 0:2 (k) < 0:2 (n), so by Lemma [HI we have the bound 



(36) 



E[M k ] 2 < 1 



a 2 (n)B 



1 



for 1 < k < n — 1 . 



Now, just from the definition ([26]) of Mk, we can write Wk as a difference between 
M k and ax(k)Sk/ (n — fc). We can then use the crude bound (x + y) 2 < 2x 2 + 2y 2 
and Cauchy's inequality to get 



max \W k \ 2 <2 max \M k \ 

Kk<rn Kk<m 



< 2 max \M k \ 

Kk<rn 



ai(k) 



n — k 
2 



2 max a 1 (k) 

Kk<rn 



s k 



<2 max \M k \ + (2m) 0*2(71) max 

Kk<m Kk<m 



s k 
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Long ago, in (|12[) . we calculated ES 2 i: so here we can apply the I? maximal 
inequality to both martingales {Sk/{n — k)} and {Mfe} to get the bound 

^ 2 ^ a 2 {n)B 



(37) E max W k z < 8 1 + + — zy '- 

i<k<m \ n — m n[n — m) J n — 1 

Such an inequality for 1 < m < n suggests the possibility of folding. To pursue this 
we hrst note 



max Wu = max 

Kk<n 



max Wj:, max Wi 



Kk<r 



m<k<n 



max W k , max | W m + 

0>m-\- 

lX m +i 



Kk<m 



m<k<n 



dkX k \ 



< max 



max Wk, 2W, 



Kk<r 



2 max |o m+1 X m+1 + 

m<k<n 



&kXk\ 



(38) 



< 2 



max Wu 

Kk<m 



max \a m +iX m+ i 

m < k < n 



a-kX k \ 



By exchangeability, one expects the second maximum has a bound like the one 
derived in (|37[) . To make this explicit, one simply needs to replace m by n — m in 
the upper bound of (|3"T|) . Doing so gives us the sister bound 

\2~ 



E max \a m +\X m+ i 

m<fc<n 



■a k X k Y <8 1 



(n — m) (n — m) \ a2(n)B 



By our bounds on the two addends of 



E max Wi < 16 2 



Kfe<r 



77! 

we then have 

2 

m 



1 



(n — m,) 2 \ a 2 (n)B 



n(n 



m) 



It only remain to take m = \_n/2\ and to attend honestly to the consequences. 
If n is even, the constant that multiplies a2(n)B / (n — 1) is exactly 80. For odd n, 
the constant approaches 80 from above as n increases to infinity, and for n = 81 
the constant is 80 + 4/205. Furthermore, Cauchy's inequality gives us 

W^ < a 2 (k)B < a 2 (n)B for all 1 < k < n, 

and we have a 2 (n)B < (80 + 4/205)a 2 (n)B/(n - 1) for all n < 81. So, in the end, 
we come to (|35p . our permutation maximal inequality with general weights. □ 

We would greet this result with some fanfare except that Garsia (1970, 3.7.20, 
p. 92) gives this bound with the constant 80. Still, one needs to keep in mind that 
we have pursued this derivation only to illustrate the usefulness of the martingales 
that are given by our general linear algebraic construction. Perhaps it is victory 
enough to come so close to a long-standing result that was originally obtained by 
a delicate problem specific, argument. 

Compared to Garsia's argument, the proof of (|35l) is straightforward. It is also 
reasonably robust and potentially capable of further development, even though 
there seems to be no room to improve the constant. In spirit the proof is close 
to the elegant argument of Stout (1974, pp. 145-148) for his version of the easier 
unweighted inequality (Proposition [2]) . In each instance, the heart of the matter is 
the application of the maximal L 2 inequality to some martingale. Here we have the 
benefit of ready access to the martingales (j4j and ((6|) that were served up to us by 
our general construction. 
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10. Observations and Connections 

Our focus here is on methodology, and our primary aim has been to demonstrate 
the usefulness of a linear algebraic method for constructing martingales from the 
basic materials of sampling without replacement. Through our examples we hope 
to have shown that the martingales given by our general construction have honest 
bite. In particular, these martingales yield reasonably direct proofs of a variety of 
permutation inequalities — both new ones and old ones. 

Among our new inequalities, the simple Hardy- type inequality (I10[) seems par- 
ticularly attractive. If we had to isolate a single open problem for attention, then 
our choice would be to determine if the constant of inequality (flO|) is best possible. 
This problem seems feasible, but one will not get any help from the easy arguments 
that show that the corresponding constant in Hardy's inequality is best possible. 

The other new inequality that seems noteworthy is the Garsia-type inequality ([8"jl 
where we introduce V(a), the measure of multiplier cancelations. This inequality 
may be long-winded, but it isolates a common situation where one can do sub- 
stantially better than the classic Garsia bound (|35j) . The quadratic permutation 
inequality (Proposition 0]) and the discrete bridge inequality ([25]) are more special- 
ized, and they may have a hard time finding regular employment. Still, they are 
perfect for the right job, and they also illustrate the diversity of the martingales 
that come from the general construction. 

We have developed several results in theory of permutation inequalities to test 
the effectiveness of the permutation martingales given by our construction, but 
permutation inequalities have a charm of their own, and one could always hope 
to do more. We have already mentioned the remarkable maximal inequalities of 
Chobanyan (1994) and Chobanyan and Salehi (2001) that exploit combinatorial 
mapping arguments in addition to martingale arguments. It would be interesting 
to see if our new martingales could help more in that context. Finally, we did not 
touch on the important weak-type (or Levy-type) permutation inequalities such as 
those studied in Pruss (1998) and Levental (2001), but it seems reasonable to expect 
that the martingales (@| and ([5]) could also be useful in the theory of weak-type 
inequalities. 
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